Search | VHL Regional Portal

1.

CyVerse: Cyberinfrastructure for open science.

Swetnam, Tyson L; Antin, Parker B; Bartelme, Ryan; Bucksch, Alexander; Camhy, David; Chism, Greg; Choi, Illyoung; Cooksey, Amanda M; Cosi, Michele; Cowen, Cindy; Culshaw-Maurer, Michael; Davey, Robert; Davey, Sean; Devisetty, Upendra; Edgin, Tony; Edmonds, Andy; Fedorov, Dmitry; Frady, Jeremy; Fonner, John; Gillan, Jeffrey K; Hossain, Iqbal; Joyce, Blake; Lang, Konrad; Lee, Tina; Littin, Shelley; McEwen, Ian; Merchant, Nirav; Micklos, David; Nelson, Andrew; Ramsey, Ashley; Roberts, Sarah; Sarando, Paul; Skidmore, Edwin; Song, Jawon; Sprinkle, Mary Margaret; Srinivasan, Sriram; Stanzione, Dan; Strootman, Jonathan D; Stryeck, Sarah; Tuteja, Reetu; Vaughn, Matthew; Wali, Mojib; Wall, Mariah; Walls, Ramona; Wang, Liya; Wickizer, Todd; Williams, Jason; Wregglesworth, John; Lyons, Eric.

PLoS Comput Biol ; 20(2): e1011270, 2024 Feb.

Article in English | MEDLINE | ID: mdl-38324613

ABSTRACT

CyVerse, the largest publicly-funded open-source research cyberinfrastructure for life sciences, has played a crucial role in advancing data-driven research since the 2010s. As the technology landscape evolved with the emergence of cloud computing platforms, machine learning and artificial intelligence (AI) applications, CyVerse has enabled access by providing interfaces, Software as a Service (SaaS), and cloud-native Infrastructure as Code (IaC) to leverage new technologies. CyVerse services enable researchers to integrate institutional and private computational resources, custom software, perform analyses, and publish data in accordance with open science principles. Over the past 13 years, CyVerse has registered more than 124,000 verified accounts from 160 countries and was used for over 1,600 peer-reviewed publications. Since 2011, 45,000 students and researchers have been trained to use CyVerse. The platform has been replicated and deployed in three countries outside the US, with additional private deployments on commercial clouds for US government agencies and multinational corporations. In this manuscript, we present a strategic blueprint for creating and managing SaaS cyberinfrastructure and IaC as free and open-source software.

Subject(s)

Artificial Intelligence , Software , Humans , Cloud Computing , Publishing

2.

Resilience Informatics for Public Health.

Iyengar, M Sriram; Merchant, Nirav; Ernst, Kacey; Rains, Stephen; Arora, Mona; Ngaybe, Maiya G Block; Gonzalez, Myla.

Stud Health Technol Inform ; 310: 1276-1280, 2024 Jan 25.

Article in English | MEDLINE | ID: mdl-38270020

ABSTRACT

Resilience research is attracting increasing attention as stressors such as pandemics and climate change impact normal life worldwide. Informatics tools can play an important role in enhancing resilience of people, communities, and organizations. We present Resilience Informatics as a sub-discipline of resilience research and propose a conceptual framework for Resilience Informatics to aid in the development and effective deployment of informatics systems for resilience.

Subject(s)

Public Health , Resilience, Psychological , Humans , Climate Change , Informatics , Pandemics

3.

Cyber-agricultural systems for crop breeding and sustainable production.

Sarkar, Soumik; Ganapathysubramanian, Baskar; Singh, Arti; Fotouhi, Fateme; Kar, Soumyashree; Nagasubramanian, Koushik; Chowdhary, Girish; Das, Sajal K; Kantor, George; Krishnamurthy, Adarsh; Merchant, Nirav; Singh, Asheesh K.

Trends Plant Sci ; 29(2): 130-149, 2024 02.

Article in English | MEDLINE | ID: mdl-37648631

ABSTRACT

The cyber-agricultural system (CAS) represents an overarching framework of agriculture that leverages recent advances in ubiquitous sensing, artificial intelligence, smart actuators, and scalable cyberinfrastructure (CI) in both breeding and production agriculture. We discuss the recent progress and perspective of the three fundamental components of CAS - sensing, modeling, and actuation - and the emerging concept of agricultural digital twins (DTs). We also discuss how scalable CI is becoming a key enabler of smart agriculture. In this review we shed light on the significance of CAS in revolutionizing crop breeding and production by enhancing efficiency, productivity, sustainability, and resilience to changing climate. Finally, we identify underexplored and promising future directions for CAS research and development.

Subject(s)

Agriculture , Artificial Intelligence , Plant Breeding

4.

PhytoOracle: Scalable, modular phenomics data processing pipelines.

Gonzalez, Emmanuel M; Zarei, Ariyan; Hendler, Nathanial; Simmons, Travis; Zarei, Arman; Demieville, Jeffrey; Strand, Robert; Rozzi, Bruno; Calleja, Sebastian; Ellingson, Holly; Cosi, Michele; Davey, Sean; Lavelle, Dean O; Truco, Maria José; Swetnam, Tyson L; Merchant, Nirav; Michelmore, Richard W; Lyons, Eric; Pauli, Duke.

Front Plant Sci ; 14: 1112973, 2023.

Article in English | MEDLINE | ID: mdl-36950362

ABSTRACT

As phenomics data volume and dimensionality increase due to advancements in sensor technology, there is an urgent need to develop and implement scalable data processing pipelines. Current phenomics data processing pipelines lack modularity, extensibility, and processing distribution across sensor modalities and phenotyping platforms. To address these challenges, we developed PhytoOracle (PO), a suite of modular, scalable pipelines for processing large volumes of field phenomics RGB, thermal, PSII chlorophyll fluorescence 2D images, and 3D point clouds. PhytoOracle aims to (i) improve data processing efficiency; (ii) provide an extensible, reproducible computing framework; and (iii) enable data fusion of multi-modal phenomics data. PhytoOracle integrates open-source distributed computing frameworks for parallel processing on high-performance computing, cloud, and local computing environments. Each pipeline component is available as a standalone container, providing transferability, extensibility, and reproducibility. The PO pipeline extracts and associates individual plant traits across sensor modalities and collection time points, representing a unique multi-system approach to addressing the genotype-phenotype gap. To date, PO supports lettuce and sorghum phenotypic trait extraction, with a goal of widening the range of supported species in the future. At the maximum number of cores tested in this study (1,024 cores), PO processing times were: 235 minutes for 9,270 RGB images (140.7 GB), 235 minutes for 9,270 thermal images (5.4 GB), and 13 minutes for 39,678 PSII images (86.2 GB). These processing times represent end-to-end processing, from raw data to fully processed numerical phenotypic trait data. Repeatability values of 0.39-0.95 (bounding area), 0.81-0.95 (axis-aligned bounding volume), 0.79-0.94 (oriented bounding volume), 0.83-0.95 (plant height), and 0.81-0.95 (number of points) were observed in Field Scanalyzer data. We also show the ability of PO to process drone data with a repeatability of 0.55-0.95 (bounding area).

5.

Interpretation and Use of Applied/Operational Machine Learning and Artificial Intelligence in Surgery.

Douglas, Molly J; Callcut, Rachel; Celi, Leo Anthony; Merchant, Nirav.

Surg Clin North Am ; 103(2): 317-333, 2023 Apr.

Article in English | MEDLINE | ID: mdl-36948721

ABSTRACT

Applications for artificial intelligence (AI) and machine learning in surgery include image interpretation, data summarization, automated narrative construction, trajectory and risk prediction, and operative navigation and robotics. The pace of development has been exponential, and some AI applications are working well. However, demonstrations of clinical utility, validity, and equity have lagged algorithm development and limited widespread adoption of AI into clinical practice. Outdated computing infrastructure and regulatory challenges which promote data silos are key barriers. Multidisciplinary teams will be needed to address these challenges and to build AI systems that are relevant, equitable, and dynamic.

Subject(s)

Artificial Intelligence , Robotics , Humans , Machine Learning , Algorithms

6.

Design and implementation of a health messaging protocol employed for use within a COVID-19 health dissemination platform.

Colombo, Paulina M; Freylersythe, Sarah; Sprinkle, Mary Margaret; Ernst, Kacey C; Yubeta, Marcela; Barbati, Juliana L; Merchant, Nirav; Iyengar, Sriram; Crane, Tracy E; Oxnam, Maliaca; Rains, Stephen A.

Front Public Health ; 10: 942795, 2022.

Article in English | MEDLINE | ID: mdl-36504998

ABSTRACT

Introduction: AZCOVIDTXT, a bilingual, two-way information sharing platform was created in April of 2020 in response to rising COVID-19 cases in Arizona. The aim of this paper is to delineate the protocol and processes used to develop and disseminate health messaging to serve as guidance for other groups, universities, or public health programs in the implementation or enhancement of health communication services. Methods: Health messaging formats included website articles, published on the system's website (azcovidtxt.org), infographics posted on social media, and SMS. Social media and SMS infographics were intended to highlight and augment the topics covered in the weekly website articles, to create a seamless multimodal source of reliable COVID-19 information for AZCOVIDTXT enrollees and the broader public. All health messaging information, text message and social media content was planned and reviewed collaboratively by the AZCOVIDTXT team topic experts for accuracy, efficacy, and content consistency. Results: As of July 2021, AZCOVIDTXT provided weekly COVID-19-related health communication to 3,747 participating households located across 225 Arizona zip codes. AZCOVIDTXT has developed and sent 446 unique, bilingual SMS for a total of 271,977 contact points. The team has produced and published 179 website articles, which averaged a combined 7,000-page views per month, and 173 social media posts were made available to 268 followers across three platforms. Discussion: Several programmatic aspects were deemed essential to the success of AZCOVIDTXT. These included (1) addressing community specific needs, (2) creating timely and relevant content, (3) developing an adaptable system, and (4) prioritizing system automation where possible, (5) having an interdisciplinary team approach to identifying and crafting key messages.

Subject(s)

COVID-19 , Social Media , Humans , COVID-19/epidemiology , Public Health , Information Dissemination , Universities

7.

A local platform for user-friendly FAIR data management and reproducible analytics.

Wieser, Florian; Stryeck, Sarah; Lang, Konrad; Hahn, Christoph; Thallinger, Gerhard G; Feichtinger, Julia; Hack, Philipp; Stepponat, Manfred; Merchant, Nirav; Lindstaedt, Stefanie; Oberdorfer, Gustav.

J Biotechnol ; 341: 43-50, 2021 Nov 20.

Article in English | MEDLINE | ID: mdl-34400238

ABSTRACT

Collaborative research is common practice in modern life sciences. For most projects several researchers from multiple universities collaborate on a specific topic. Frequently, these research projects produce a wealth of data that requires central and secure storage, which should also allow for easy sharing among project participants. Only under best circumstances, this comes with minimal technical overhead for the researchers. Moreover, the need for data to be analyzed in a reproducible way often poses a challenge for researchers without a data science background and thus represents an overly time-consuming process. Here, we report on the integration of CyVerse Austria (CAT), a new cyberinfrastructure for a local community of life science researchers, and provide two examples how it can be used to facilitate FAIR data management and reproducible analytics for teaching and research. In particular, we describe in detail how CAT can be used (i) as a teaching platform with a defined software environment and data management/sharing possibilities, and (ii) to build a data analysis pipeline using the Docker technology tailored to the needs and interests of the researcher.

Subject(s)

Data Management , Software , Austria

8.

UAS-Based Plant Phenotyping for Research and Breeding Applications.

Guo, Wei; Carroll, Matthew E; Singh, Arti; Swetnam, Tyson L; Merchant, Nirav; Sarkar, Soumik; Singh, Asheesh K; Ganapathysubramanian, Baskar.

Plant Phenomics ; 2021: 9840192, 2021.

Article in English | MEDLINE | ID: mdl-34195621

ABSTRACT

Unmanned aircraft system (UAS) is a particularly powerful tool for plant phenotyping, due to reasonable cost of procurement and deployment, ease and flexibility for control and operation, ability to reconfigure sensor payloads to diversify sensing, and the ability to seamlessly fit into a larger connected phenotyping network. These advantages have expanded the use of UAS-based plant phenotyping approach in research and breeding applications. This paper reviews the state of the art in the deployment, collection, curation, storage, and analysis of data from UAS-based phenotyping platforms. We discuss pressing technical challenges, identify future trends in UAS-based phenotyping that the plant research community should be aware of, and pinpoint key plant science and agronomic questions that can be resolved with the next generation of UAS-based imaging modalities and associated data analysis pipelines. This review provides a broad account of the state of the art in UAS-based phenotyping to reduce the barrier to entry to plant science practitioners interested in deploying this imaging modality for phenotyping in plant breeding and research areas.

9.

SARS-CoV-2 Rapid Antigen Testing of Symptomatic and Asymptomatic Individuals on the University of Arizona Campus.

Harris, David T; Badowski, Michael; Jernigan, Brandon; Sprissler, Ryan; Edwards, Taylor; Cohen, Randall; Paul, Stephen; Merchant, Nirav; Weinkauf, Craig C; Bime, Christian; Erickson, Heidi E; Bixby, Billie; Parthasarathy, Sairam; Chaudhary, Sachin; Natt, Bhupinder; Cristan, Elaine; El Aini, Tammer; Rischard, Franz; Campion, Janet; Chopra, Madhav; Insel, Michael; Sam, Afshin; Knepler, James L; Knox, Kenneth; Mosier, Jarrod; Spier, Catherine; Dake, Michael D.

Biomedicines ; 9(5)2021 May 12.

Article in English | MEDLINE | ID: mdl-34066047

ABSTRACT

SARS-CoV-2, the cause of COVID19, has caused a pandemic that has infected more than 80 M and killed more than 1.6 M persons worldwide. In the US as of December 2020, it has infected more than 32 M people while causing more than 570,000 deaths. As the pandemic persists, there has been a public demand to reopen schools and university campuses. To consider these demands, it is necessary to rapidly identify those individuals infected with the virus and isolate them so that disease transmission can be stopped. In the present study, we examined the sensitivity of the Quidel Rapid Antigen test for use in screening both symptomatic and asymptomatic individuals at the University of Arizona from June to August 2020. A total of 885 symptomatic and 1551 asymptomatic subjects were assessed by antigen testing and real-time PCR testing. The sensitivity of the test for both symptomatic and asymptomatic persons was between 82 and 90%, with some caveats.

10.

Ten simple rules to cultivate transdisciplinary collaboration in data science.

Sahneh, Faryad; Balk, Meghan A; Kisley, Marina; Chan, Chi-Kwan; Fox, Mercury; Nord, Brian; Lyons, Eric; Swetnam, Tyson; Huppenkothen, Daniela; Sutherland, Will; Walls, Ramona L; Quinn, Daven P; Tarin, Tonantzin; LeBauer, David; Ribes, David; Birnie, Dunbar P; Lushbough, Carol; Carr, Eric; Nearing, Grey; Fischer, Jeremy; Tyle, Kevin; Carrasco, Luis; Lang, Meagan; Rose, Peter W; Rushforth, Richard R; Roy, Samapriya; Matheson, Thomas; Lee, Tina; Brown, C Titus; Teal, Tracy K; PapeÈ, Monica; Kobourov, Stephen; Merchant, Nirav.

PLoS Comput Biol ; 17(5): e1008879, 2021 05.

Article in English | MEDLINE | ID: mdl-33983959

Subject(s)

Cooperative Behavior , Data Science , Interdisciplinary Communication , Interprofessional Relations , Computational Biology , Data Science/ethics , Data Science/organization & administration , Data Science/trends , Humans , Intersectoral Collaboration

11.

iVirus 2.0: Cyberinfrastructure-supported tools and data to power DNA virus ecology.

Bolduc, Benjamin; Zablocki, Olivier; Guo, Jiarong; Zayed, Ahmed A; Vik, Dean; Dehal, Paramvir; Wood-Charlson, Elisha M; Arkin, Adam; Merchant, Nirav; Pett-Ridge, Jennifer; Roux, Simon; Vaughn, Matthew; Sullivan, Matthew B.

ISME Commun ; 1(1): 77, 2021 Dec 14.

Article in English | MEDLINE | ID: mdl-36765102

ABSTRACT

Microbes drive myriad ecosystem processes, but under strong influence from viruses. Because studying viruses in complex systems requires different tools than those for microbes, they remain underexplored. To combat this, we previously aggregated double-stranded DNA (dsDNA) virus analysis capabilities and resources into 'iVirus' on the CyVerse collaborative cyberinfrastructure. Here we substantially expand iVirus's functionality and accessibility, to iVirus 2.0, as follows. First, core iVirus apps were integrated into the Department of Energy's Systems Biology KnowledgeBase (KBase) to provide an additional analytical platform. Second, at CyVerse, 20 software tools (apps) were upgraded or added as new tools and capabilities. Third, nearly 20-fold more sequence reads were aggregated to capture new data and environments. Finally, documentation, as "live" protocols, was updated to maximize user interaction with and contribution to infrastructure development. Together, iVirus 2.0 serves as a uniquely central and accessible analytical platform for studying how viruses, particularly dsDNA viruses, impact diverse microbial ecosystems.

12.

Community-Level Health Promotion during a Pandemic: Key Considerations for Health Communication.

Rains, Stephen A; Crane, Tracy E; Iyengar, Sriram; Merchant, Nirav; Oxnam, Maliaca; Sprinkle, Mary Margaret; Ernst, Kacey C.

Health Commun ; 35(14): 1747-1749, 2020 12.

Article in English | MEDLINE | ID: mdl-33084404

Subject(s)

Health Communication , Influenza, Human , Communication , Health Promotion , Humans , Influenza, Human/epidemiology , Influenza, Human/prevention & control , Pandemics/prevention & control

13.

Ten simple rules for organizing a data science workshop.

Ponsero, Alise; Bartelme, Ryan; de Oliveira Almeida, Gustavo; Bigelow, Alex; Tuteja, Reetu; Ellingson, Holly; Swetnam, Tyson; Merchant, Nirav; Oxnam, Maliaca; Lyons, Eric.

PLoS Comput Biol ; 16(10): e1008226, 2020 10.

Article in English | MEDLINE | ID: mdl-33090991

Subject(s)

Data Science , Education, Continuing , Computer Literacy , Data Science/education , Data Science/organization & administration , Education, Continuing/methods , Education, Continuing/organization & administration , Humans

14.

The principles of tomorrow's university.

Katz, Daniel S; Allen, Gabrielle; Barba, Lorena A; Berg, Devin R; Bik, Holly; Boettiger, Carl; Borgman, Christine L; Brown, C Titus; Buck, Stuart; Burd, Randy; de Waard, Anita; Eve, Martin Paul; Granger, Brian E; Greenberg, Josh; Howe, Adina; Howe, Bill; Khanna, May; Killeen, Timothy L; Mayernik, Matthew; McKiernan, Erin; Mentzel, Chris; Merchant, Nirav; Niemeyer, Kyle E; Noren, Laura; Nusser, Sarah M; Reed, Daniel A; Seidel, Edward; Smith, MacKenzie; Spies, Jeffrey R; Turk, Matt; Van Horn, John D; Walsh, Jay.

F1000Res ; 7: 1926, 2018.

Article in English | MEDLINE | ID: mdl-30687499

ABSTRACT

In the 21st Century, research is increasingly data- and computation-driven. Researchers, funders, and the larger community today emphasize the traits of openness and reproducibility. In March 2017, 13 mostly early-career research leaders who are building their careers around these traits came together with ten university leaders (presidents, vice presidents, and vice provosts), representatives from four funding agencies, and eleven organizers and other stakeholders in an NIH- and NSF-funded one-day, invitation-only workshop titled "Imagining Tomorrow's University." Workshop attendees were charged with launching a new dialog around open research - the current status, opportunities for advancement, and challenges that limit sharing. The workshop examined how the internet-enabled research world has changed, and how universities need to change to adapt commensurately, aiming to understand how universities can and should make themselves competitive and attract the best students, staff, and faculty in this new world. During the workshop, the participants re-imagined scholarship, education, and institutions for an open, networked era, to uncover new opportunities for universities to create value and serve society. They expressed the results of these deliberations as a set of 22 principles of tomorrow's university across six areas: credit and attribution, communities, outreach and engagement, education, preservation and reproducibility, and technologies. Activities that follow on from workshop results take one of three forms. First, since the workshop, a number of workshop authors have further developed and published their white papers to make their reflections and recommendations more concrete. These authors are also conducting efforts to implement these ideas, and to make changes in the university system. Second, we plan to organise a follow-up workshop that focuses on how these principles could be implemented. Third, we believe that the outcomes of this workshop support and are connected with recent theoretical work on the position and future of open knowledge institutions.

Subject(s)

Universities , Career Choice , Community Participation , Community-Institutional Relations , Education , Humans , Information Technology , Research

15.

Overcoming the Law of the Hidden in Cyberinfrastructures.

Bucksch, Alexander; Das, Abhiram; Schneider, Hannah; Merchant, Nirav; Weitz, Joshua S.

Trends Plant Sci ; 22(2): 117-123, 2017 02.

Article in English | MEDLINE | ID: mdl-28027865

ABSTRACT

Cyberinfrastructure projects (CIPs) are complex, integrated systems that require interaction and organization amongst user, developer, hardware, technical infrastructure, and funding resources. Nevertheless, CIP usability, functionality, and growth do not scale with the sum of these resources. Instead, growth and efficient usage of CIPs require access to 'hidden' resources. These include technical resources within CIPs as well as social and functional interactions among stakeholders. We identify approaches to overcome resource limitations following the conceptual basis of Liebig's Law of the Minimum. In so doing, we recommend practical steps towards efficient and scaleable resource use, taking the iPlant/CyVerse CIP as an example.

Subject(s)

Computational Biology/methods , Plants , Models, Biological , Software

16.

Bringing your tools to CyVerse Discovery Environment using Docker.

Devisetty, Upendra Kumar; Kennedy, Kathleen; Sarando, Paul; Merchant, Nirav; Lyons, Eric.

F1000Res ; 5: 1442, 2016.

Article in English | MEDLINE | ID: mdl-27803802

ABSTRACT

Docker has become a very popular container-based virtualization platform for software distribution that has revolutionized the way in which scientific software and software dependencies (software stacks) can be packaged, distributed, and deployed. Docker makes the complex and time-consuming installation procedures needed for scientific software a one-time process. Because it enables platform-independent installation, versioning of software environments, and easy redeployment and reproducibility, Docker is an ideal candidate for the deployment of identical software stacks on different compute environments such as XSEDE and Amazon AWS. CyVerse's Discovery Environment also uses Docker for integrating its powerful, community-recommended software tools into CyVerse's production environment for public use. This paper will help users bring their tools into CyVerse Discovery Environment (DE) which will not only allows users to integrate their tools with relative ease compared to the earlier method of tool deployment in DE but will also help users to share their apps with collaborators and release them for public use.

17.

PGen: large-scale genomic variations analysis workflow and browser in SoyKB.

Liu, Yang; Khan, Saad M; Wang, Juexin; Rynge, Mats; Zhang, Yuanxun; Zeng, Shuai; Chen, Shiyuan; Maldonado Dos Santos, Joao V; Valliyodan, Babu; Calyam, Prasad P; Merchant, Nirav; Nguyen, Henry T; Xu, Dong; Joshi, Trupti.

BMC Bioinformatics ; 17(Suppl 13): 337, 2016 Oct 06.

Article in English | MEDLINE | ID: mdl-27766951

ABSTRACT

BACKGROUND: With the advances in next-generation sequencing (NGS) technology and significant reductions in sequencing costs, it is now possible to sequence large collections of germplasm in crops for detecting genome-scale genetic variations and to apply the knowledge towards improvements in traits. To efficiently facilitate large-scale NGS resequencing data analysis of genomic variations, we have developed "PGen", an integrated and optimized workflow using the Extreme Science and Engineering Discovery Environment (XSEDE) high-performance computing (HPC) virtual system, iPlant cloud data storage resources and Pegasus workflow management system (Pegasus-WMS). The workflow allows users to identify single nucleotide polymorphisms (SNPs) and insertion-deletions (indels), perform SNP annotations and conduct copy number variation analyses on multiple resequencing datasets in a user-friendly and seamless way. RESULTS: We have developed both a Linux version in GitHub ( https://github.com/pegasus-isi/PGen-GenomicVariations-Workflow ) and a web-based implementation of the PGen workflow integrated within the Soybean Knowledge Base (SoyKB), ( http://soykb.org/Pegasus/index.php ). Using PGen, we identified 10,218,140 single-nucleotide polymorphisms (SNPs) and 1,398,982 indels from analysis of 106 soybean lines sequenced at 15X coverage. 297,245 non-synonymous SNPs and 3330 copy number variation (CNV) regions were identified from this analysis. SNPs identified using PGen from additional soybean resequencing projects adding to 500+ soybean germplasm lines in total have been integrated. These SNPs are being utilized for trait improvement using genotype to phenotype prediction approaches developed in-house. In order to browse and access NGS data easily, we have also developed an NGS resequencing data browser ( http://soykb.org/NGS_Resequence/NGS_index.php ) within SoyKB to provide easy access to SNP and downstream analysis results for soybean researchers. CONCLUSION: PGen workflow has been optimized for the most efficient analysis of soybean data using thorough testing and validation. This research serves as an example of best practices for development of genomics data analysis workflows by integrating remote HPC resources and efficient data management with ease of use for biological users. PGen workflow can also be easily customized for analysis of data in other species.

Subject(s)

Genome, Plant , Glycine max/genetics , Polymorphism, Genetic , Sequence Analysis, DNA/methods , Software , Genomics/methods , High-Throughput Nucleotide Sequencing/methods , Workflow

18.

xGDBvm: A Web GUI-Driven Workflow for Annotating Eukaryotic Genomes in the Cloud.

Duvick, Jon; Standage, Daniel S; Merchant, Nirav; Brendel, Volker P.

Plant Cell ; 28(4): 840-54, 2016 04.

Article in English | MEDLINE | ID: mdl-27020957

ABSTRACT

Genome-wide annotation of gene structure requires the integration of numerous computational steps. Currently, annotation is arguably best accomplished through collaboration of bioinformatics and domain experts, with broad community involvement. However, such a collaborative approach is not scalable at today's pace of sequence generation. To address this problem, we developed the xGDBvm software, which uses an intuitive graphical user interface to access a number of common genome analysis and gene structure tools, preconfigured in a self-contained virtual machine image. Once their virtual machine instance is deployed through iPlant's Atmosphere cloud services, users access the xGDBvm workflow via a unified Web interface to manage inputs, set program parameters, configure links to high-performance computing (HPC) resources, view and manage output, apply analysis and editing tools, or access contextual help. The xGDBvm workflow will mask the genome, compute spliced alignments from transcript and/or protein inputs (locally or on a remote HPC cluster), predict gene structures and gene structure quality, and display output in a public or private genome browser complete with accessory tools. Problematic gene predictions are flagged and can be reannotated using the integrated yrGATE annotation tool. xGDBvm can also be configured to append or replace existing data or load precomputed data. Multiple genomes can be annotated and displayed, and outputs can be archived for sharing or backup. xGDBvm can be adapted to a variety of use cases including de novo genome annotation, reannotation, comparison of different annotations, and training or teaching.

Subject(s)

Software , Computational Biology , Genome, Plant/genetics , Workflow

19.

The iPlant Collaborative: Cyberinfrastructure for Enabling Data to Discovery for the Life Sciences.

Merchant, Nirav; Lyons, Eric; Goff, Stephen; Vaughn, Matthew; Ware, Doreen; Micklos, David; Antin, Parker.

PLoS Biol ; 14(1): e1002342, 2016 Jan.

Article in English | MEDLINE | ID: mdl-26752627

ABSTRACT

The iPlant Collaborative provides life science research communities access to comprehensive, scalable, and cohesive computational infrastructure for data management; identity management; collaboration tools; and cloud, high-performance, high-throughput computing. iPlant provides training, learning material, and best practice resources to help all researchers make the best use of their data, expand their computational skill set, and effectively manage their data and computation when working as distributed teams. iPlant's platform permits researchers to easily deposit and share their data and deploy new computational tools and analysis workflows, allowing the broader community to easily use and reuse those data and computational analyses.

Subject(s)

Computational Biology/organization & administration , Internet , Software

20.

The EPIC Kids Study: a randomized family-focused YMCA-based intervention to prevent type 2 diabetes in at-risk youth.

Hingle, Melanie D; Turner, Tami; Kutob, Randa; Merchant, Nirav; Roe, Denise J; Stump, Craig; Going, Scott B.

BMC Public Health ; 15: 1253, 2015 Dec 18.

Article in English | MEDLINE | ID: mdl-26679186

ABSTRACT

BACKGROUND: It is well established that behavioral lifestyle interventions resulting in modest weight reduction in adults can prevent or delay type 2 diabetes mellitus; however in children, successful weight management interventions are rarely found outside of controlled clinical settings. The lack of effective community-based programs is a barrier to reducing obesity prevalence and diabetes risk in children. The objective of our study is to develop and test a group-randomized family-centered community-based type 2 diabetes prevention intervention targeting at-risk children, 9- to 12-years-old. METHODS/DESIGN: Using participatory methods, the adult-focused YMCA Diabetes Prevention Program was adapted for families, creating a novel lifestyle behavior change program focused on healthy eating, physical activity, and a supportive home environment. The program will be tested in sixty 9- to 12-year-old children at risk of diabetes and sixty parents over 12 consecutive weeks with two intervention formats randomized by location: a face-to-face instructor-led program, or a hybrid program with alternating face-to-face and mobile technology-delivered content. Anthropometric, behavioral, psychosocial and physiological outcomes will be assessed at baseline, post-intervention (12 weeks), and follow-up (24 weeks). Secondary outcomes are participant acceptability, feasibility, and adherence. The RE-AIM framework (reach, efficacy, adoption, implementation, and maintenance) will guide intervention implementation and evaluation. Changes at 12 weeks will be assessed using a paired t-test combining both delivery formats. Exploratory models using linear regression analysis will estimate the magnitude of the difference between the face-to-face and hybrid format. The sample size of 60 children, informed by a previous YMCA intervention in which -4.3 % change in overweight (SE = 1.1) was observed over 6 months, will give us 80 % power to detect an effect size of this magnitude, assuming a one-sided test at alpha = 0.05. DISCUSSION: The proposed study capitalizes on a partnership with the YMCA, a popular and widespread community organization, and uses mobile technologies to extend program reach while potentially reducing burden associated with weekly attendance. The long-term goal is to create a scalable, replicable, and sustainable pediatric "diabesity" prevention program that overcomes existing barriers to the translation of efficacious interventions into effective community programs. TRIAL REGISTRATION: ClinicalTrials.gov, NCT02421198 on April 15, 2015.

Subject(s)

Diabetes Mellitus, Type 2/prevention & control , Family , Health Promotion/organization & administration , Behavior Therapy , Child , Female , Humans , Life Style , Male , Obesity/prevention & control , Overweight , Program Evaluation , Research Design , Residence Characteristics , Risk Factors

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL